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Abstract 

Random walks defined on undirected graphs assign the absolute scores 
to all nodes based on the quality of path they provide for random walk- 
ers. In city space syntax, the notion of segregation acquires a statistical 
interpretation with respect to random walks. We analyze the spatial net- 
work of Venetian canals and detect its most segregated part which can be 
identified with canals adjacent to the Ghetto of Venice. 
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1 Introduction 

The spatial network of Venice that stretches across 122 small is- 
lands between which the canals serve the function of roads is consti- 
tuted by 96 canals. In March, 1516 the Government of the Serenis- 
sima Repubblica issued special laws, and the first Ghetto of Europe 
was instituted. It was an area where Jews were forced to live and 
which they could not leave from sunset to dawn. The Ghetto existed 



for more than two and a half centuries, until Napoleon conquered 
Venice and finally opened and eliminated every gate (1797). 

The phenomenon of clustering of minorities, especially that of 
newly arrived immigrants, is well documented pQ (the reference ap- 
pears in [2]). Clustering is considering to be beneficial for mutual 
support and for the sustenance of cultural and religious activities. 
At the same time, clustering and the subsequent physical segrega- 
tion of minority groups would cause their economic marginalization. 
The spatial analysis of the immigrant quarters [2] and the study of 
London's changes over 100 years |3] shows that they were signifi- 
cantly more segregated from the neighboring areas, in particular, 
the number of street turning away from the quarters to the city cen- 
ters were found to be less than in the other inner-city areas being 
usually socially barricaded by the railways, canals and industries. 

It has been suggested ^ that space structure and its impact on 
movement are critical to the link between the built environment and 
its social functioning. Spatial structures creating a local situation 
in which there is no relation between movements inside the spatial 
pattern and outside it and the lack of natural space occupancy be- 
come associated with the social misuse of the structurally abandoned 
spaces. It is well known that the urban layout effects on the spatial 
distribution of crime [S]. It has been noted that crime seems to be 
highest where the urban grid is most broken up (in effect creating 
most local segregation), and lowest where the lines are longest, and 
in fact most integrated. The linear routes through the estate have 
least burglary, and the most broken up, locally enclosed spaces the 
most [1]. 

In the present paper, we analyze the integration/segregation statis- 
tics of the Venetian canal network by means of random walks. In the 
forthcoming section (Sec. [2]), we discuss the role of city space syn- 
tax for attaining traffic equilibrium in the city transport network. 
Then, in Sec. El we show that random walks establish the Euclidean 
space structure on undirected graph, in which distances and angles 
have the clear statistical interpretations well known in quantitative 
theory of random walks. In Sees. [Hand [5], we analyze Venetian space 
syntax and detect the Ghetto of Venice as the most segregated part 
of the city canal network. Then, we conclude in the last section. 
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2 City space syntax 



Studies of urban networks have a long history. In most of re- 
searches devoted to the improving of transport routes, the opti- 
mization of power grids, and the pedestrian movement surveys the 
relationships between certain components of the urban texture are 
often measured along streets and routes considered as edges of a 
planar graph, while the traffic end points and street junctions are 
treated as nodes. Such a primary graph representation of urban 
networks is grounded on relations between junctions through the 
segments of streets. The usual city map based on Euclidean geom- 
etry can be considered as an example of primary city graphs. 

The notion of traffic equilibrium had been introduced by J.G. 
Wardrop in [7j and then generalized in [S] to a fundamental con- 
cept of network equilibrium. Given a connected undirected graph 
G{V,E), in which V is the set of nodes and E is the set of edges, 
we can define the traffic volume f : E {0,oo[ through every edge 
e E E. It then follows from the Perron-Frobenius theorem that the 
linear equation 



has a unique positive solution /(e) > 0, for every edge e & E, for 
a fixed positive constant h > and a chosen set of positive metric 
length distances i{e) > 0. This solution is naturally identified with 
the traffic equilibrium state of the transport network defined on G, 
in which the permeability of edges depends upon their lengths. The 
parameter h is called the volume entropy of the graph G, while the 
volume of G is defined as the sum Vol(G') = | J2e&E ^(^)- 

The degree of a node v E V is the number of its neighbors in G, 
deg(f) = k^. It has been shown in (BJ that among all undirected 
connected graphs of normalized volume, Vo^G) = 1, which are not 
cycles and fc^ 7^ 1 for all nodes, the minimal possible value of the 
volume entropy. 
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is attained for the length distances 

„, X _ log((fci(e) - l) (fct(e) - l)) 



(3) 
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where i[e) G V and tie) G V are the initial and terminal vertices of 
the edge e E E respectively. It is then obvious that substituting ([2]) 
and ([3]) into ([1]) the operator exp {—hi{e)) is given by a symmetric 
Markov transition operator, 

/(e) = ^ ,^ (4) 



y (/i;i(e) - l) {kt(e) - l) 

which rather describes time reversible random walks over edges than 
over nodes. The flows satisfying with the operator (jl]) meet 
the mass conservation property, Yliir^j fij — ^i; Ylij^v^i ~ '^^^ 
Eq.Q unveils the indispensable role Markov's chains defined on 
edges play in equilibrium traffic modelling and exposes the degrees 
of nodes as a key determinant of the transport networks properties. 

Wardrop's traffic equilibrium |7] is strongly tied to the human 
apprehension of space since it is required that all travellers have 
enough knowledge of the transport network they use. The human 
perception of places is not an entirely Euclidean one, but are rather 
related to the perceiving of the vista spaces (streets and squares) as 
single units and of the understanding of the topological relationships 
between these vista spaces, [9]. Decomposition of city space into a 
complete set of intersecting vista spaces produces a spatial network 
which we call the dual graph representation of a city. Therein, the 
relations between streets treated as nodes are traced through their 
junctions considered as edges. 

Dual city graphs are extensively investigated within the concept 
of space syntax, a theory developed in the late 1970s, that seeks to 
reveal the mutual effects of complex spatial urban networks on soci- 
ety and vice versa, [10l|TT]. Spatial perception that shapes peoples 
understanding of how a place is organized determines eventually the 
pattern of local movement which is predicted by the space syntax 
method with surprising accuracy [12]. The robustness of agreement 
between centrality of spaces and rush hour movement rates is now 
supported by a number of similar studies of pedestrian movement 
in different parts of the world and in an everyday commercial work 
of the Space Syntax Ltd., 
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3 Study of undirected graphs by means of ran- 
dom walks 

The issues of global connectivity of finite graphs and accessibil- 
ity of their nodes have always been the classical fields of researches 
in graph theory. Any graph representation naturally arises as an 
outcome of categorization, when we abstract a real world system 
by eliminating all but one of its features and by the grouping of 
things (or places) sharing a common attribute by classes or cate- 
gories. For instance, the common attribute of all open spaces in 
city space syntax is that we can move through them. All elements 
called nodes that fall into one and the same group V are consid- 
ered as essentially identical; permutations of them within the group 
are of no consequence. The symmetric group Sat consisting of all 
permutations of elements (A^ being the cardinality of the set V) 
constitute the symmetry group of V. If we denote hj E <0 V x V 
the set of ordered pairs of nodes called edges, then a graph is a map 
G{V, E) : E K CRj^ (we suppose that the graph has no multiple 
edges) . 

The nodes of G(y, E) are weighted with respect to some measure 
m = ^jgy?7ii(5j, specified by a set of positive numbers rrii > 0. 
The space i'^{m) of square-assumable functions with respect to the 
measure m is the Hilhert space H (a complete inner product space). 
Among all linear operators defined on 7i, those invariant under the 
permutations of nodes are of essential interest since they reflect the 
symmetry of the graph. Although there are infinitely many such 
operators, only those which maintain conservation of a quantity may 
describe a physical process. The Markov transition operators which 
share the property of probability conservation considered in theory 
of random walks on graphs are among them. Laplace operators 
describing diffusions on graphs meet the mean value property {mass 
conservation); they give another example [H] studied in spectral 
graph theory. 

Being defined on connected undirected graphs, a Markov transi- 
tion operator T has a unique equilibrium state vr (a stationary dis- 
tribution of random walks) such that ttT = vr and tt = lim^^oo crT* 
for any density a E H {cTi > 0, J^iev'^i ~ There is a unique 
measure m,r = Xliey^**^* related to the stationary distribution vr 
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with respect to which the Markov operator T is self-adjoint^ 

where is the adjoint operator. The orthonormal ordered set of 
real eigenvectors ipi, i = 1 . . . N, of the symmetric operator T de- 
fines a basis in Ti. In quantitative theory of random walks defined 
on graphs [161 [IS] and in spectral graph theory [17], the proper- 
ties of graphs are studied in connection with the eigenvalues and 
eigenvectors of self-adjoint operators defined on them. In particu- 
lar, the symmetric transition operator defined on undirected graphs 
is Tij = Xj ^fk^j. Its first eigenvector belonging to the largest 
eigenvalue /xi = 1, 

describes the local property of nodes (connectivity), tTj = ki/2M^ 
where 2M = Xliev^*' '^hile the remaining eigenvectors {ips}^^2 
belonging to the eigenvalues 1 > > . . . hn > —1 describe the 
global connectedness of the graph. 

Markov's symmetric transition operator T defines a projection of 
any density a G 7i on the eigenvector ipi of the stationary distribu- 
tion TT, 

af=iJi + a^f, a^ = (T-Vi, (7) 

in which a-^ is the vector belonging to the orthogonal complement 
of ipi- Thus, it is clear that any two densities a,p E H differ 
with respect to random walks only by their dynamical components, 
(cr — p)T* = (o"-'" — p^)T^ for alH > 0. Therefore, we can define a 
distance between any two densities which they acquire with respect 
to random walks by 

II ii2 

W^- pWt 

or, in the spectral form, 

N 

t>0 s=2 



= J2{--P\T'\--P)- (8) 

t>o 

- pWs){'^s\(y - p) = 2_^ , 

^=2 ^ 

(9) 
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where we have used Diracs bra-ket notations especially convenient 
in working with inner products and rank-one operators in Hilbert 
space. 

If we introduce in Ti.{V) a new inner product by 

{^^P)t = 2^2^ f^- (10) 

for all cr,p E 'H^V), then ([9]) is nothing else as 

Ik-Pllr = lkllT+ II P IIt- 2 (o-,p)y, (11) 

where 



||2 

a 



T 



j2 (q^l^«)(^5k) ^^2) 



being the squared norm of a G Ti-lV) with respect to random walks. 
We finish the description of the (A^ — l)-dimensional Euclidean space 
structure associated to random walks by mentioning that given two 
densities cr, p E Ti.iV), the angle between them can be introduced 
in the standard way, 

cosZ{p,a) = - — - — . (13) 

II ^ IIt II p IIt 

Random walks embed connected undirected graphs into Euclidean 
space. This embedding can be used in order to compare nodes and 
to retrace the optimal coarse-graining representations. 

Namely, let us consider the density 5, which equals 1 at the node 

— 1/2 

i E V and zero for all other nodes. It takes form Vi = tt- 6i with 
respect to the measure m^. Then, the squared norm of Vi is given 
by 

where i/jg^i is the i^'^-component of the eigenvector ipg. In theory of 
random walks [TB] , the quantity f|T^ expresses the access time to a 
target node quantifying the expected number of steps required for 
a random walker to reach the node i E V starting from an arbitrary 
node chosen randomly among all other nodes with respect to the 
stationary distribution vr. 
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The Euclidean distance between any two nodes of the graph G 
estabhshed by random walks, 



II - II: 



(15) 



is known as the commute time in theory of random walks and equals 
to the expected number of steps required for a random walker start- 
ing at z G V to visit j & V and then to return to i again, [IGJ . The 
expected number of steps a random walker reaches j if starts from 
i is called the first hitting time (or the access time) [16], 



The cosine of an angle calculated in accordance to f|T3|) has the 
structure of Pearson's coefficient of linear correlations that reveals 
it's natural statistical interpretation. Correlation properties of flows 
of random walkers passing by different paths have been remained 
beyond the scope of previous studies devoted to complex networks 
and random walks on graphs. The notion of angle between any two 
nodes in the graph arises naturally as soon as we become interested 
in the strength and direction of a linear relationship between two 
random variables, the flows of random walks moving through them. 
If the cosine of an angle (fT3l) is 1 (zero angles) , there is an increasing 
linear relationship between the flows of random walks through both 
nodes. Otherwise, if it is close to -1 (vr angle), there is a decreasing 
linear relationship. The correlation is (7r/2 angle) if the variables 
are linearly independent. It is important to mention that as usual 
the correlation between nodes does not necessary imply a direct 
causal relationship (an immediate connection) between them. 

In the forthcoming sections, we study the average of the first 
access time Hij f|T6|) with respect to its first and second indices. 

4 Detection of ghettos by access to a target node 

The Euclidean structure introduced on undirected graphs by ran- 
dom walks can be used in order to investigate city space syntax. 
Various properties of access times have been recently studied in 
concern with the traffic fiow forecasting [IB], in order to model the 




) 



(16) 
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wireless terminal movements in a cellular wireless network [19] , in a 
statistical test for the presence of a random walk component in the 
repeat sales price index models in house prices [20], in the growth 
modelling of urban conglomerations [2T], and in many other works 
where random walks have been considered directly on the city maps 
and physical landscapes. In contrast to all previous studies, we use 
random walks in order to investigate the morphology of urban tex- 
tures described by dual city graphs. In the context of traditional 
space syntax studies, the value of access time (the Euclidean norm, 
f|T^ ) calculated for a node of a dual graph can be naturally inter- 
preted as the expected number of random elementary navigation 
actions (i.e., the random turns at the junctions between axial lines) 
required to reach the certain open space in the city. 

The access to a target node introduced in theory of random walks 
[IB] is the important global characteristic of the node which is equal 
to the average of Hji (fT6|) over its first index. It quantifies the ex- 
pected number of steps required for a random walker to reach the 
node starting from an arbitrary node chosen randomly among all 
other nodes of the network with respect to the stationary distribu- 
tion vr, 

jeV * k=2 

Being a global characteristic of a node in the graph, the access ( [T7|) 
assigns the absolute scores to all nodes in the network based on 
the quality of path they provide for random walks. In urban spa- 
tial networks, the value given by (|T71) varies strongly from one open 
space to another: the norm of a space (street, square, or canal) that 
can be easily reached (just in a few random syntactic steps) from 
any other space in the city is minimal, while it could be very large 
for a statistically segregated street. The relatively isolated groups of 
nodes and bottlenecks can be visually detected from distribution his- 
tograms of the Euclidean norms (see Fig. [1]). In particular, it would 
facilitate the detection of ghettos and urban spraw0. In Fig. [H we 
have presented the histogram of empirical distribution of mean ac- 
cess times for the dual graph of Venetian canals. It is interesting to 
note that the utmost peak on the histogram correspondent from 250 
to 300 random syntactic steps represents the canals of the Cannare- 

^ Urban sprawl is the spreading out of a city and its suburbs at the fringe of an urban area. 
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Figure 1: Access spectrum of Venice canals. Tlie iiistogram of norms acquired by tlie 
Venetian canals with respect to random walks. 



gio district surrounding the quarter of Venetian Ghetto. From 16* 
century, the quarter had been enlarged later to cover the neigh- 
boring Ghetto Vecchio and the Ghetto Nuovissimo. As a result a 
specific Ghetto canal subnetwork arose in Venice weakly connected 
to the main canals. The norm (1141) which a node acquires in regard 
to random walks is a natural statistical centrality measure of the 
vertex within a graph. Probably, the most important message of 
space syntax theory is that the local property (connectivity) of city 
spaces (streets and squares) and their global configuration variable 
(centrality) are positively related in a city, and this part-whole rela- 
tionship known as intelligibility [IT| is a key determinant of human 
behaviors in urban environments. An adequate level of intelligibility 
has been found to encourage peoples way-finding abilities. Intelligi- 
bility of Venetian canal network reveals itself quantitatively in the 
scaling of the norms of nodes with connectivity shown in Fig. [2l The 
three utmost points in the left upper part of the graph displayed on 
Fig. [2] represent the most intelligible structure in the Venetian canal 
network formed by the Venetian Lagoon, the Giudecca canal, and 
the Grand Canal. The four points characterized by the worse ac- 
cessibility levels delineate the Ghetto canal subnetwork segregated 
statistically from the rest of the network. 
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Figure 2: The scatter plot of connectivity vs. the norm of a node in the dual graph 
representation of 96 Venetian canals. The slope of the regression line equals 2.07. 

5 Random target access time 

The average of access time with respect to its second index, the 
random target access time [IB], determines the expected number of 
steps a random walker needs to reach an arbitrary node of the graph 
chosen randomly from the stationary distribution vr (if a random 
walk starts in i G V). In contrast to f|T7|) . the value of this average 
is independent of the starting node i & V being a global spectral 
characteristic of a graph, 



The latter equation expresses the so called random target identity, 
[16] . Computations of random target access times f|T8|) for the dual 
city graphs show that they are closely tied to the network size (i.e., 
the numbers of junctions between different streets and canals in a 
city) (see Fig. [3]). The diagram shown on Fig. [3]have been calculated 
for the dual graph representations of two German organic medieval 
cities founded shortly after the Crusades and developed within the 
medieval fortresses (Rothenburg ob der Tauber in Bavaria and the 
downtown of Bielefeld, nowadays the biggest city of Eastern West- 
phalia), the street grid in Manhattan (a borough of New York City 
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Figure 3: The comparative diagram of the random target access times and the sizes 
of dual graphs for five compact urban patterns. Heights of left pillars correspond 
to the number of nodes in the dual city graphs. Pillars on right show the random 
target access times, the expected number of steps a random walker needs to hit a node 
randomly chosen from the stationary distribution tt. 



with an almost regular grid-like city plan), and two city canal net- 
works, in Amsterdam (binding to the delta of the Amstel river, form- 
ing a dense canal web exhibiting a high degree of radial symmetry) 
and in Venice (that stretches across 122 small islands between which 
the canals serve the function of roads), within the same frame. 

6 Conclusion 

The properties of random walks defined on the dual graph rep- 
resentation of a transport network is related to the unique equilib- 
rium configuration of not random commodity flows along edges of its 
primary graph representation that unveils the role Markov's chain 
processes and space syntax approach play in the successful traffic 
modelling. The amazing effectiveness of random walk models in de- 
scribing complex cooperative phenomena has been often discussed 
in literature, [22]. In space syntax theory, the concept of random 
walks (implicitly reckoned by means of integration measures) has 
been applied to the "probabilistic" analysis of spatial configurations 
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and proved its power by the surprisingly accurate predictions of hu- 
man behavior in cities [12]. 

We have demonstrated that random walks establish the Euclidean 
space structure on undirected graphs, in which the notions of dis- 
tance and angle acquire the clear statistical interpretations. In par- 
ticular, every open space (a street, a square, or a canal) acquires 
a norm based on the quality of path it provides for random walks. 
This norm can be used as a measure of statistical segregation the 
node is characterized in the given network. 

In the present paper, we have canalized space syntax of Vene- 
tian's canals by means of random walks defined on the dual graph 
representation of the network and detect the segregated part that is 
identified with the specific canal subnetwork of Venetian Ghetto. 
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